Goto

Collaborating Authors

 legal act


The Russian Legislative Corpus

Saveliev, Denis, Kuchakov, Ruslan

arXiv.org Artificial Intelligence

A comprehensive and up-to-date collection of Russian legislation considered a'gold standard' does not exist. From a legal perspective, the collapse of the Soviet Union in the 1990s and the new Russian statehood became a starting point for collecting legal documents. While not every Soviet-era legal act was repealed, most had been completely abolished. We collect all federal regulations covering 1991 to 2023 and prepared these texts for linguistic analysis. We discuss the place of the present corpus among the related corpora (Section 2). We also provide a brief institutional context of the Russian promulgation routine (Section 3) and descriptive statistics (Section 4) and describe our processing pipeline (Sections 5, 6, 7).

  Country:
  Genre: Research Report (0.40)
  Industry:

CBR-RAG: Case-Based Reasoning for Retrieval Augmented Generation in LLMs for Legal Question Answering

Wiratunga, Nirmalie, Abeyratne, Ramitha, Jayawardena, Lasal, Martin, Kyle, Massie, Stewart, Nkisi-Orji, Ikechukwu, Weerasinghe, Ruvan, Liret, Anne, Fleisch, Bruno

arXiv.org Artificial Intelligence

Retrieval-Augmented Generation (RAG) enhances Large Language Model (LLM) output by providing prior knowledge as context to input. This is beneficial for knowledge-intensive and expert reliant tasks, including legal question-answering, which require evidence to validate generated text outputs. We highlight that Case-Based Reasoning (CBR) presents key opportunities to structure retrieval as part of the RAG process in an LLM. We introduce CBR-RAG, where CBR cycle's initial retrieval stage, its indexing vocabulary, and similarity knowledge containers are used to enhance LLM queries with contextually relevant cases. This integration augments the original LLM query, providing a richer prompt. We present an evaluation of CBR-RAG, and examine different representations (i.e. general and domain-specific embeddings) and methods of comparison (i.e. inter, intra and hybrid similarity) on the task of legal question-answering. Our results indicate that the context provided by CBR's case reuse enforces similarity between relevant components of the questions and the evidence base leading to significant improvements in the quality of generated answers.


LexDrafter: Terminology Drafting for Legislative Documents using Retrieval Augmented Generation

Chouhan, Ashish, Gertz, Michael

arXiv.org Artificial Intelligence

With the increase in legislative documents at the EU, the number of new terms and their definitions is increasing as well. As per the Joint Practical Guide of the European Parliament, the Council and the Commission, terms used in legal documents shall be consistent, and identical concepts shall be expressed without departing from their meaning in ordinary, legal, or technical language. Thus, while drafting a new legislative document, having a framework that provides insights about existing definitions and helps define new terms based on a document's context will support such harmonized legal definitions across different regulations and thus avoid ambiguities. In this paper, we present LexDrafter, a framework that assists in drafting Definitions articles for legislative documents using retrieval augmented generation (RAG) and existing term definitions present in different legislative documents. For this, definition elements are built by extracting definitions from existing documents. Using definition elements and RAG, a Definitions article can be suggested on demand for a legislative document that is being drafted. We demonstrate and evaluate the functionality of LexDrafter using a collection of EU documents from the energy domain.


Entity Graph Extraction from Legal Acts -- a Prototype for a Use Case in Policy Design Analysis

Wróblewska, Anna, Pieliński, Bartosz, Seweryn, Karolina, Saputa, Karol, Wichrowska, Aleksandra, Sysko-Romańczuk, Sylwia, Schreiber, Hanna

arXiv.org Artificial Intelligence

This paper presents research on a prototype developed to serve the quantitative study of public policy design. This sub-discipline of political science focuses on identifying actors, relations between them, and tools at their disposal in health, environmental, economic, and other policies. Our system aims to automate the process of gathering legal documents, annotating them with Institutional Grammar, and using hypergraphs to analyse inter-relations between crucial entities. Our system is tested against the UNESCO Convention for the Safeguarding of the Intangible Cultural Heritage from 2003, a legal document regulating essential aspects of international relations securing cultural heritage.